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Abstract 



Exactly solvable models of linear aggregation have been known since Ising's seminal one- 
dimensional model. This model is defined by a unique nearest-neighbour bond strength that 
is independent of the length of the cluster; known as isodesmic symmetry. Linear aggregation 
in real systems has often been associated with broken isodesmic symmetry. Here we show 
that important examples can be mapped to a class of one-dimensional models that are also 
exactly solvable. 
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A model has isodesmic symmetry if every member of the infinite set of cluster equilibria 
[1] + [n — 1] -v^ [n] define the same equilibrium 'constant', as is appropriate to linear aggre- 
gation (self-assembly) where the energy required to break a chain aggregate is independent 
of the position of the break and the length of the chain. The one dimensional (ID) Ising 
P and equivalent Lattice-Gas (LG) models are isodesmic everywhere in phase-space. We 
shall consider the generalisation where the first member of the set only, [1] + [1] <^ [2], 
has a different equilibrium constant to all the rest. When treated phenomenologically with 
the "law of mass action" as an implicit solvent model this is a well-known problem in the 
physical chemistry of aggregation [2] and is also the basis of current understanding of the 
self-assembly of peptide tapes involved in protein miss-folding diseases such as Alzheimer's 
dementia [3]. 

Let us specify a ID LG hamiltonian with two energies; namely, an energy e m that an 
isolated monomer [1] must take up in the process of bonding to a cluster, and an attractive 
nearest-neighbour energy e + e m for each bond within a cluster of length n > 1. Then, 
when a monomer joins a cluster [n > 1] the change in energy involved is e, but when two 
monomers combine to form a dimer the gain in attractive energy is only e — e m . Let us use the 
exponential factor r = e^ € to represent a temperature field (j3 denotes inverse temperature in 
units of Boltzmann's constant) and it is convenient to define 7 = e m /e. A direct route to the 
exact equation of state and cluster distribution is to apply Widom's potential distribution 
theorem (PDT), 

f = YTp = z(e~^) )c , (1) 

where z = denotes the monomer activity (/i is chemical potential), p is the density in 
LG units (probability of occupancy of a single cell), the subscript c indicates a conditional 
average (there is an empty cell at position x enforced by the 1/(1 — p) factor on the left 
side of the equation) and ip denotes the change in energy that would arise if a monomer 
were to be inserted into this empty cell. The evaluation of this average (or ratio of partition 
functions) can be broken down into insertions into a finite set of specific clusters; let us use 
p w , fw for occupancy next to an empty cell, pi, fi for occupancy next to a monomer isolated 
on the other side and p a , f a for occupancy next to the end of an fixed aggregate (needed 
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below to evaluate the cluster distribution). Figure 1 enumerates all the possibilities required 
for our model, together with the corresponding values of ip. One can then use conditional 
probability to evaluate (0Q) by inspection (this can be done in any order but here I shall start 
at the insertion cell and move right and left): 

/ = z [(1 - p w ) 2 + 2(1 - p w )p w {l - pijr 1 ^ + 2(1 - p w )p w piT 
+ Plil- Pifr 2 -" + 2/4(1 - Pl ) Pl r 2 + p^r 2 + 7 } 
= z {(1 - p w f + 2(1 - p w ) p w (l - pi)r 1-7 + p w p 



(2) 



+ T> 



PwO- ~ Pi)r 7 + p w piT } . 



(3) 



To evaluate f w , fi (and f a ), one simply applies the PDT to insertions in the presence of the 
fixed clusters at the left of the insertion cells depicted in Fig. 2; each case is associated with 
three non-degenerate states. From Fig. 2a we have 



f w =Z {(1 - p w ) + p w {l - PljT 1 7 + p w piT } , 

while from Fig. 2b and Fig. 2c, respectively, 
h = far' 7 , 

fa = z {(1 - p w )r + p w (l - p{)T 2 + p w piT 2+1 } . 
Inserting into leads to a simplification: 

f fw , / -y i \ fw \fz 

To evaluate f w (T,z,j) and hence also obtain /, fi and f a , one can solve 
simultaneous equations; e.g. 

/«,(! + /«,)(! + = z(l + h + fwr 1 ^ + hf w r) 

fwd + f W )T-f\(l + f w ) = Z{T-T 1 -^). 

The solution yields 

/i(l + /i) = Ur^ + fir) , 
fM + fw) = z(l + h) , 

together with a cubic equation of state for f w : 

z \r - t 1 -') = z(l + f w )(l + f w r) - f w {l + f w ) 2 . 



(4) 

(5) 
(6) 

(7) 

as a pair of 

(8) 
(9) 

(10) 
(11) 

(12) 



Although the general mathematical solution for one-dimensional models is invariably a cubic, 
for short-range models in equilibrium there must never be more than one physical solution 
in order to preserve the absence of phase transitions in one-dimension [5|. For our non- 
isodesmic model we see that this arises because ( |T2|) is also a quadratic for the activity, 
which only ever has one physical solution: 



2z(r - r 1 " 7 ) = (! + /« 



1 + f w r - V(l + M 2 - - r 



(13) 



In this version of the equation of state the variables r and f w are a field and a density, 
respectively. One can transform the equation of state into p(r, p) by using f w as an implicit 
variable varying between zero and infinity in both equations fll3p and (J7|). The occupation 
variables f% and / a similarly follow from fllip and (151) . respectively. Note that there is no re- 
gion of phase space associated with imaginary solutions contained within the above analysis. 



The entire cluster distribution is also defined by the above occupation probabilities: 

[1] = (l- p ) Pw (l- Pl ) 

= (l-p)(l- Pw )p aT - 1 , (14) 

[n>l] = (1- p)p wPl p n a ~ 2 {l- p a ) 

= Wr-Vr 1 - (15) 

Here, I have chosen to build the clusters in strict order of left to right and note that an 
isolated monomer is actually a cluster of two empty sites enclosing an occupied site. Note 
also that to obtain the second line of (JHJ) one requires eqn. ffTUj) . It is straightforward to 
directly check the self-consistency of the PDT by showing that (I7j) is also the solution of the 
sum rule 

P=[l] + £ n[n] . (16) 

n>l 

The clusters [n > 1] satisfy the isodesmic exponential distribution [n > 1] = [l]r~ 7 p™ -1 , 
renormalized by a factor r -7 , and thus have an average cluster length 

N t ^> lU } n } =2 + f a = 2 + f^ . (17) 

The only effect of the broken isodesmic symmetry is to cause the aggregation distribution 
to step down a factor of r~ 7 between isolated monomers (n = 1) and dimers (n = 2). In 
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the case of peptide tape formation, the broken isodesmic symmetry (7 > 0) results from the 
energy barrier that must be overcome by a helical monomer when unwinding in the process 
of joining a beta-sheet tape. Experimental procedures exist that can directly measure the 
percentage of peptide in a beta-sheet confirmation, [3]. This defines the order parameter 

These results are essentially identical to the free-energy minimization proposed by Aggeli et 
al [3]. The only mathematical difference is that the LG model has an additional factor of 
(1 — p)(l — p w ) multiplying each cluster probability density [n], to allow for arbitrarily high 
concentration. From the geometry of typical peptide tapes [H] one notes that a monomer 
concentration of 10 -6 molar maps roughly to p = 10~ 6 in LG units. Since the experimental 
data for <pt{p) in Fig. 6b of Aggeli et al is at concentrations less than 500 micromolar, this 
factor is completely irrelevant. Accordingly, we have identified an exactly solvable LG model 
with identical aggregation statistics to a known phenomenological analysis that has been 
successfully fitted to experimental data. The data imply a value of 7 around 0.2. There 
is however one physical difference with the free-energy analysis adopted in ref.[3]; namely, 
instead of normalising by the volume of a monomer (as is automatic in the context of a ID 
LG model) the authors choose a volume approximately 1000 times smaller. From ( fl4]fT5|) 
this is equivalent to increasing the values of both r and r 1+7 by 1000, which is not suffi- 
ciently inconsistent at 7 = 0.2 to noticeably affect the fit to the data. However, it does 
result in a significant renormalization of the predicted binding energy lnr associated with 
the self-assembly. Aggeli et al describe their small volume as characterising the movement 
of monomers within a tape and thus might be regarded as a rough correction for the entropy 
cost associated with forming a linear aggregate in a 3D solution, in contrast to a ID model. 



By breaking the isodesmic symmetry with a non-zero value of 7 the self-assembly of 
dimers is suppressed to higher concentrations, after which the rest of the cluster distribution 
kicks in. This induces a sigmoidal shape to 4>t(p) just before the percentage of beta-sheet 
self-assembly reaches 50%: 

f a = V^+T-1 ; fa = 1/2. (19) 

There is no discontinuous jump in the order parameter <p t (no van der Waals 'loop' in p(p)), 
since as already discussed there is no region of two-phase coexistence for these short-range ID 
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models. This behaviour is reminiscent of micelle formation; since the 'micelles' are tapes, the 
concentration at which this rapid increase in linear aggregation takes place can be denoted 
a 'etc'. It is therefore of some interest that we have identified a class of exactly solvable 
ID models that display the phenomenon of a etc. These broken isodesmic LG models are 
defined explicitly by the hamiltonian 

H = -(e + e m )X>t i+1 " em£*i(l - - U+i) ]U = 0,1. (20) 

i i 

A non-zero value of 7 is therefore mathematically equivalent to the introduction of a three- 
body interaction. 
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FIGURE CAPTIONS 



FIG. 1. Insertion energies ip f° r evaluating the equation of state of a ID non-isodesmic LG 
model from the PDT (Tj[|). Single-hatched cells denote the constraint that a cell be empty, 
while double-hatched cells are constrained to be occupied. The insertion cell is constrained 
to be empty but is identified by leaving it unshaded. All remaining cells (not shown) have 
no constraints. Factors of 2X arise from ID symmetry about the insertion cell. 

FIG. 2. Insertion energies tp required when using the PDT (Tj[|) to evaluate probabilities of 
occupancy directly to the right of constrained cells or walls, defining the cluster distribution 
of a ID non-isodesmic LG model, (a) Empty wall: p w ; (b) Monomer wall: p\\ (c) Cluster 
wall: p a . Single-hatched cells denote the constraint that a cell be empty, while double- 
hatched cells are constrained to be occupied. The insertion cell is constrained to be empty 
but is identified by leaving it unshaded. All remaining cells (not shown) have no constraints. 
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CONTENTS PAGE 

H = — (e + e m ) J2i UU+i — £m J2i — — ; *i = 0, 1. 

Broken isodesmic symmetry can be an important aspect of linear aggregation in solution. 
It is shown that experimental data on the self-assembly of peptide tapes can be mapped to 
exactly solvable one-dimensional models that incorporate this broken symmetry. 
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